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The present invention relates to the field of 
multimedia communications, and more particularly a 
process and apparatus therefor for videoconf erences 
that provides link-ups among several attendants and 
5 with extremely variable characteristics and modalities, 
adaptable to any specific need of the user. 

Currently, multiple user videoconf erence apparati 
and techniques are known, and despite being based on 
different execution parameters, they make the choice of 
10 the image to be shown to the attendants on the grounds 
of the audio signal coming from the attendants 
themselves, which is technically called "audio 
presence". < 

In other words, /the sound received by the 
microphone located at yfevery equipped location gets to 
the centralised videoconf erence management device. 
This device shows all the attendants the image of the 
user that has generated the sound impulse. In such a 
way, all the attendants receive the image of the person 
that is speaking /at that precise moment of time on 
their screns. 1ft is therefore clear that if two or 
more users speafk at the same time, the conference 
management devyce carries out image commutations on a 
continuous bas/is, causing considerable disruptions and 
25 chaos all along the course of the videoconf erence 
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itself. 

Attention is also drawn to the fact that a user is 
allowed into a dialogue which has already started, even 
because of a background noise from his own environment, 
5 which could be completely independent from his will but 
is detected by the microphone located at his place. 

Currently, in order to resolve such problems, it 
is necessary to turn off one's own microphone (but this 
risks turning an interesting debate into an endless 

10 monologue) . 

This type of automatic commutation caused by the 
audio presence, necessarily requires the presence of an 
interpreter next to each single attendant, in case of 
videoconf erences that involve people speaking different 

15 languages. 

Besides this, current technology does not always 
provide carrying out a link-up between different 
videocommunication systems. The apparati which are 
currently being used in fact only allow file 
20 transmission and/or sharing just in case the link-up 
devices of the several attendants are made by the same 
manufacturer, in so doing drastically limiting the 
possibilities of employment of the system itself (file 
sharing, transmission and transfer, etc.). 
25 A further problem of the prior art is given by the 

fact that the possibility of executing fadings among 
the images of the speakers that make their 
contributions along the way, and possible audio-video 
contributions, whether they be films, photographs, 
30 static images, graphs and so on, is ruled out. 


2 


A further disadvantage of the currently viable 
videoconf erences, is given by the fact that it is not 
possible to superimpose titles, subtitles, 

abbreviations, speakers' names, musical themes and 
soundtracks, and all audio and video effects that can 
make of a "flat" and static videoconf erence a real 
television programme . 

In this, respect, it is useful to observe that said 
problems and drawbacks have not only got purely 
aesthetic consequences, but they also cause a rapid 
decrease in the level of attention of the attendants, 
which is an extremely important factor for the success 
of a conference of whatever type. 

It is also Known, from EP-A-061967 9, a multi- 
location television conference system that connects 
five locations A, B, C, D, and E, when speeches take 
place at the four locations A, B, C, and D at the same 
time, at a listening location E, images of all the 
speaking: locations A, B, C, and D are displayed on one 
screen with four divided screen areas. On the other 
hand, at the speaking location A, images of the 
speaking locations B, C f and D and an image of the 
former speaking location E are displayed on one screen 
with four divided screen areas. In addition, when 
images of speaking locations are displayed, locations 
names thereof are also displayed. Thus, a television 
conference held at a plurality of locations at a time 
can be smoothly managed as with a real conventional 
conference . 

A first disadvantage of this conference system is 
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that it does not allow the connection among systems 
having different transmission protocols, different type 
of signals or different technologies . 

A second disadvantage of EP-A-0619679 is that said 
limitations prevent the simultaneous connection and use 
of quite different transmission channels such as 
satellite, computer network, telephone lines, internet, 
and so on. 

A third disadvantage of EP-A-0619679, is that the 
information that can be displayed on the screen of each 
user, by superimposition with the images of the most 
recent speaker/s, are very limited and require the use 
and the creation of identification codes. 

A fourth disadvantage of EP-A-0619679 is the i it 
does not jzfxovide means for substitute the audio signal 
of one jot more user with the audio signal coming from a 
simuixaneous-translation room that translate, in real- 
time, the discourse of the speaker in that of the user. 

Another disadvantage of the system disclosed in 
EP-A-0619679 is that the switching of the images 
displayed is submitted to the detection of an audio 
signal . >, 

A first aim of the present invention is that of 
allowing the course of videoconf erences (congresses, 
debates, presentations, lectures, etc.) with the 
utilization of audiovisual contributions such as films, 
slides, photographs, animated computer aided design, 
graphs, music and/or soundtracks etc. 

A second aim of the present invention is that of 
guaranteeing an orderly and fluent course of a 


videoconf erence, thanks to the audio-visual 

commutations carried out by the operators of the 
direction room, and by the possible presence of a 
chairperson, who is meant to allow the user to 
personally take part in the debate, only at the most 
suitable moment . 

A third aim of the present invention is that of 
giving the possibility to attend a conference even to 
Internet users. Furthermore, through a series of 
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procedures and suitable links, which will be analysed 
in detail in the foregoing, giving the possibility to 
any single spectator who is suitably equipped to 
directly enter and take part in the conference, 
contributing to it with his own image and his own audio 
(even if not originally scheduled) . 

A fourth aim of the present invention is that of 
guaranteeing compatibility between different 

videocommunication systems, utilising the most suitable 
interfaces and transforming the ensemble of the 
videoconference into many point-point links (user- 
direction) with personalised characteristics and 
communication protocols. 

To this purpose attention is drawn to the fact 
that attendants, whether they be interactive or not, 
can be both remote and local and numberwise limitless. 

These and other aims have been accomplished 
according to the invention, by proposing a process and 
an apparatus for the production and management of 
videoconferences, wherein audiovisual signals coming 
from a plurality of remote and/or neighbour locations, 
are acquired and elaborated by a direction room capable 
of dealing with and selecting both the audio and the 
video signal, adding audiovisual contributions like 
television effects, partial or total image 
superimposition, insertion of graphs, tables, films or 
soundtracks, audio commentaries, and so on. 

According to the process and the apparatus which 
are herein described, it is also possible to provide a, 
centralised interpretation service, discriminating on 
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the audio supplied by the users as a function of their 
language . 

A better understanding of the present invention 
. will be gained thanks to the following detailed 
5 description with reference to the appended drawings, 
which schematically illustrate a preferred embodiment 
of the invention. 

In the drawings: 

Fig. 1 schematically illustrates the parts making 
10 up the direction room according to the present 
invention ; 

Fig. 2 is a scheme illustrating the modalities and 
possibilities of link-up between the direction room and 
remote and/or neighbour users, by use . of telephone 
15 lines, via satellite ,- via Internet, and so on. 

Wi eference to the abovementioned figures, the 

process object of the present invention comprises the 
following stages: 

-link-up in a direction room 1 with a plurality of 
20 remote and/or neighbour locations 2, which generate an 
audio video signal 7W; 

-conversion, if necessary, of the audiovisual 
signal AV from every location, before its transfer from 
the place where it is generated to that where the 
25 direction room 1 is located, to adapt it to the type of 
connection and transmission which is employed; 

-reconversion of the received signal, if 
necessary, to an audio- video format, before its 
entrance to direction room 1; 
30 -selection of the signal/s to be. used and sent, 
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respectively, to the attendants and the speakers by an 
entrance audio-video matrix MV1; 

-addition of the necessary audiovideo 
contributions and effects, as well as of titles, 
5 soundtracks, commentaries, graphs, and so on, by mixer 
video MIX or computer with analogous functions: 

-selections of the processed audiovideo signals 
and their forwarding to the several remote locations 2, 
as a function of the fact that at that moment they are 
10 attendants or speakers. 

According to a particular aspect of the process 
described above, while the attendants receive the audio 
video signal from the speaker, the latter will be 
capable of receiving a different audio signal which has 
15 been selected by the direction room. 

For example, the speaker will be capable of 
receiving an overview of all the attendants or of some 
of them, just by using a device that selects the 
desired signals from the signals AV of the several 
20 locations and forwards them to the output audio video 
matrix for the following forwarding to the speaker. 

Moreover, the speaker might have a graph that he 
is commenting to the attendants on his own screen, and 
these are bound to receive it full screen whilst seeing 
25 the image of the speaker himself superimposed or 
occupying a portion of the screen itself. 

A second advantageous aspect of the present 
invention is that it is possible to send audio signal 
A coming from the speaker to an interpretation room I, 
30 wherein a simultaneous translation is carried out into 
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the languages required by the attendants. 

The signal that is sent to each attendant 
therefore consists of the video signal (VI, V2, . . . , Vn) 
ad hoc selected for him, to which a suitable audio 
5 signal has been associated (Al, A2, . ..,An), therefore 
corresponding to the translation required by the user- 
It is obvious that more than one user can receive the 
same audio video signal AV. 

Advantageously, according to the process that is 
10 herein described, it is also possible to record the 
audio video signal for an archive, just as it is 
watched by the attendants, that is with the audiovisual 
contributions and the television effects that have been 
added. 

15 In so far as yche apparatus apt to carry out the 

. process so far described is concerned, within it there 
may substantially be envisaged a plurality of user- 
locations 2 /fig. 1), which are remote and/or local, and 
of the multimedial or interactive type, possibly 

20 equipped/ with a codif ier /decodif ier , otherwise called 
CODEC, /with an aggregator that transforms the analog 
audicwideo signal AV into a digital signal, and linked 
to ar direction room 1 that exchanges a signal AV of the 
analog or digital audio visual type. 

25 Said signal AV contains a bunch of information 

relative to the conference and the speaker or the 
speakers that are given the right to speak from time to 
time , as well as other auxiliary audiovisual 
information. 

30 Said user-locations 2 comprise audio visual 
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input /output means, such as for example computer or 
multimediai stations, tie-line linked-up locations, 
while the signal transmission between said locations 
and the direction room, and vice-versa, can take place 
regardless through (analog ond/or ISDN) telephone 
lines, which can themselves be aggregate or not, 
satellite transmission appliances, data transmission 
networks (including Internet), and so on. 

The signal from each remote location 2, whether it 
be digital or analog, is converted into an audiovideo 
signal, while afterwards it is sent to an audio video 
matrix MV1 which deals . with all the signals and gives 
one or more output signals . 

From a strictly practical point of view, direction 
room 1 simultaneously receives signals AV from all 
users 2 connected to the video conference, and it 
further, controls the audiovideo synchronism in each 
single channel and, if necessary, it suitably modifies 
it (any possible lacks of alignment can be generated by 
several components: transmission, channel aggregation, 
reconversion) . 

Signals AV coming from locations 2 are each 
visualised by a number of monitors and they are 
forwarded to audiovideo matrix MV1 . 

The signals which have been selected are sent to a 
video mixer MIX, or computer with analogous functions, 
which is apt to act as an interface with a series of 
appliances like Personal Computers PC, Videotape 
recorders VD1, cameras, titlers T, audio equipment, and 
so on . 
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According to a peculiar aspect of the present 
invention, the employment of such video mixer MIX 
advantageously provides the addition to or the 
superimpositioh onto the videoconf erence signal, that 

5 is the signal coming from the speaker, a series of 
audiovisual contributions such as titles, subtitles, 
musical themes, soundtracks, audio and video fadings, 
slides and/or graphs. 

Furthermore it is possible to visualise the name 

10 of the speaker that is talking in a certain definite 
moment, to carry out image superimpositions , to utilise 
and apply special effects and/or whatever other 
audiovisual contribution that makes the videoconf erence 
more versatile and adaptable to the needs of a specific 

15 - moment - 

This means that it is also possible to 
superimpose, back up with or create effects between the 
image of the speaker and films that support his talk, 
or graphs that he is creating himself and/or changing 
20 in that moment, and so on. 

Advantageously, during a certain videoconf erence 
this makes it possible to emphasise moments of 
particular interest, and furthermore to underline 
relevant data during' the talk, to highlight the aims to 
25 accomplish and/or particularly relevant news for the 
.topic which is being dealt with. 

Thus, the audio video signal which has been 
elaborated by the video mixer MIX or by a computer with 
analogous functions, is forwarded to a second 
30 audiovideo matrix MV2 and finally to a videotape 
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recorder VD2 which records the videoconference. 

This second audiovideo matrix MV2 , or visual 
signal sorting-out device, supplies the audio-video 
signals to be sent to each single user 2, whether they 

5 be remote or local . 

The two input and output commutation devices of 
the direction room (audio video matrices MV1, MV2 or 
analogous devices) ensure a total compatibility between 
different videocommunication systems, through said 

10 plurality of CODEC or specific interfaces, so as to 
make it possible to carry out transmissions involving 
apparati with technological features that made them 
incompatible so far. Moreover, it is possible to use 
just one video -matrix, if this is believed necessary by. 

15 the direction room, In lieu of the two abovementioned 
ones. 

As previosly properly highlighted, another 
peculiar feature of the present invention is given by 
the fact that it is possible to capture audio signal A 
20 before it reaches output audio video matrix MV2, so as 
to make it possible to have a simultaneous translation 
by one or more interpreters into the language or 
languages used by the attendants if these explicitly 
showed a need for it or if they made a clear request to 
25 the organisation. 

In other words, audio signal A that is sent into 
interpretation room I for translation is then 
associated to video signal V at the output of the 
second audiovideo matrix MV2 in real time, in such a 
30 way that the translation or the translations are 
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listened to by all the attendants that requested to be 
supplied with such a service. 

Advantageously, according to the invention, 
direction i can intervene at any moment by using 
5 audiovideo matrices, substituting audiovideo signal AV 
which is forwarded to one or more remote or local 
attendants 2 with audio video signal AVR, accomplishing 
an 'intercom" type communication while the users who 
are not interested keep attending the videoconf erence 

10 without any disruptions or interferences. 

From what explained so far follows that signal AV 
which is elaborated by direction room 1 must be of the 
analog or digital audio video type: therefore the input 
and output signals, i.e directed to and coming from it, 

15 which are not audio video, must be transformed before 
their employment and finally ref transformed at the very 
moment when they are to be sent to remote attendants in 
the analog or digital form. 

These two input • and output conversions at the 

20 direction room, depend on the features of the link-up 
.with the remote users, once again categorisable as 
digital or analogue, which can be carried out by means 
that the user believes more suitable: analogue, ISDN or 
aggregate ISDN telephone lines, satellite transmission, 

25 computer networks (such as Internet for example), and 
so on . 

From what described so far, it appears to be 
rather clear that all the - attendants to the 
videconf erence receive the audio video signal from the 
30 person that is speaking. Advantageously though, by 
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doubling all the incoming signals, on the speaker's 
screen there will be found to be shown the attendant to 
whom he is answering directly or with whom he intends 
to engage in a discussion, or in a cyclical fashion, 
that is all the participants to the conference (one by 
one or by groups, resorting to audio video multi-signal 
simultaneous combination devices) . 

To said signal which is forwarded to the speaker 
another signal can be added or substituted, this latter 
having been selected by the direction. 

This is accomplished by a targeted or cyclical 
selection device SR, whose output signal is exclusively 
sent to the user that is at that moment playing the 
role of speaker, or otherwise to a group of users,; this 
is done by resorting to the second audio video matrix 
MV2 and whatever else is believed to be most suitable 
for that purpose by the direction. 

It is useful to observe that a cyclical selection 
can take place at controllable time intervals, by dint 
of a timer-programmer or a computer for example. 

According to another peculiar feature of the 
present invention, the director has the possibility of 
selecting the speaker who is scheduled to talk at that 
moment and who will be shown full screen to all the 
other speakers and/or attendants 2. Together with 
that, it is also possible to keep the audio channel of 
all or part of the attendants 2 active, enabling the 
apparatus to automatically visualise the participants 
that take part briefly and temporarily, in the form of 
windows or pointers (spots) suitably placed on the 
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screen . 

Another extremely advantageous aspect of the 
present invention is the possibility of transmitting 
the- videoconf erence via Internet. By suitable 

5 (aggregate or tie-line) connections between the 
direction room and- the Internet provider, it is 
possible to broadcast the audio video signal. AV of the 
videoconf erence, that comes from the audio-video output 
matrix MV2 , and whatever Internet user. 

10 Furthermore, by a suitable discussion group, each 

single user can ask questions, show examples and 
actively take part in the debate. 

The chairperson or the person in charge of the 
videoconf erence will be capable of visualising all the 

15 communications of the final users or attendants, by a 
computer PCM connected to the same discussion group. 

He will be capable of ascertaining whether they 
are worth being addressed to one of the speakers that 
will be then able to answer through the channels and 

20 the already described modalities of the 
videoconf erence . 

If on the other hand the chairman will believe it 
suitable to personally let the Internet user UI 
contribute to the videoconf erence , direction room 1 is 

25 capable of carrying out an unexpected but nonetheless 
possible telephone link-up AV-UI, turning the Internet 
user UI into an actor from spectator as he was, 
offering him a chance to come and take part in the 
conference just in the same manner as that given to the 

30 other participants that are connected (provided that 
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said latecomer has the minimum equipment necessary for 
taking part in a videoconf erence which has the 
previously described modalities and features) . 

Advantageously, in the case of an Internet lirik'- 
5 up, thanks to besides normal switch or ISDN telephone 
lines, the connection between the remote user and the 
provider can be carried out by dint of a mixed signal 
management system where the requests of the user are 
transmitted to the provider down the telephone lines, 
10 wkereas the audio video signal of the videoconf erence 
or of the data which have been required can be received 
via satellite, leading to a drastic improvement of 
quality and increasing the speed of reception 
regardless of the traffic on the network and of the 
15 amount of users connected to it at that very moment. 

Furthermore, using the Internet, it is possible to 
carry out transmission and data file exchange, 
regardless of the type of data therein contained, in a 
manner which is absolutely compatible with any type of 
20 computer or computer system. 

Said remote or neighbour locations 2 may also 
comprise a camera and a microphone which are apt to 
send the audiovisual signal from a certain event like a 
parade or a sports match, to direction room 1 that is 
25 going to manage it in the most suitable manner. 

According to the present invention, it is possible 
to conduct even very '"intense" debates between a 
limited number of participants, avoiding frequent image 
changes; this is accomplished by subdividing the screen 
30 into adjacent windows and enabling the audio of the 
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entire discussion group. In this case there are found 
to be shown only those who are part of said restricted 
group of people on the screen and at the same time. 

It is useful to notice that using CODECS, it is 
5 possible to control remote cameras based at locations 
2. This means that the staff in the direction room is 
capable of showing or zooming details at their own 
discretion, by sending suitable directions that are 
bound to be executed by the camera located at the 
10 user's location. 

In particular cases, it is finally possible to 
envisage link-ups between direction room 1 and the 
users exclusively via satellite. 

The present invention can also be applied to other 
15 fields such as: conferences, training and refresher 
courses, sales, advertising, consultancy services, 
tourism and others . 

The present invention has been described and 
illustrated according to one preferred embodiment, but 
20 it holds that whoever skilled in the art may well amend 
or change it without stepping out of the scope of the 
present patent. 
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